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(54) l ille: A METHOD FOR IDENTIFYING A SYNTHETIC MOLECULE HAVING AEEINITY TOWARDS A TARGET 



(57) Abstract: The present invention relates to a method for identifying a synthetic molecule having affinity towards a target. The 
method includes the steps of providing a lihrary of Afunctional complexes, wherein each complex of the lihrary comprises a synthetic 
molecule attached to an identifier, which codes for said molecule; suhjecling, under binding conditions, the library of Afunctional 
complexes to a target; removing the non-binding members of the library; separating the identifiers ofcomplcxes comprising synthetic 
j^- molecules having affinity towards (he target, and decoding the identifiers to establish die identity of the molecule. 
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A METHOD FOR IDENTIFYING A SYNTHETIC MOLECULE HAVING AFFINITY 
TOWARDS A TARGET 



This application is a non-provisional of U.S. provisional application Serial No. 
5 60/483,899 filed on 2 July 2003, which is hereby incorporated by reference in its 

entirety. All patent and non-patent references cited in the application, or in the pres- 
ent application, are also hereby incorporated by reference in their entirety. 

Technical Field of the Invention 

10 The present invention relates to a method for identifying from a library a synthetic 

molecule having affinity towards a target. The synthetic molecule has initially been a 
part of a complex also comprising an identifier that codes for said synthetic 
molecule. The invention further relates to a library of complexes, which in a certain 
embodiment can be used in the method. The experiments supporting the present 

15 method suggest an enrichment of more than a million times, e.g. a specific synthetic 
molecule may be identified in a library of 10 6 + complexes. 

Background 

Traditional drug discovery begins with a pathological phenomenon in an.organism 
20 and the development of a therapeutic theory to combat this. A chemical concept 
follows to produce compounds for screening. Most of the processes for curing the 
pathological phenomenon originate with the understanding of some biological path- 
ways and screening for an effect in tissues or cells. This may or may not eventually 
reveal a "target". The target can be identified by various conventional methods, in- 
25 eluding protein expressing, protein chemistry, structure-functional studies, knowl- 
edge of biochemical pathways, and genetic studies. 



In recent years, genetic information has increasingly guided the identification of sin- 
gle molecular targets. These are derived from the knowledge of the genes of spe- 
30 cific cell phenotypes that encode proteins that may be involved in the pathogenesis 
of a particular disease state. 

A lead is a compound, usually a small organic molecule that demonstrates a desired 
biological activity on a target. Usually, a collection of compounds, referred to as a 
35 "library", is screened before a useful lead is identified. Today, many libraries are 
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commercially available or open to public. Most pharmaceutical companies house 
their own compilation of compounds that have been synthesised over several years 
and screened against a variety of targets. 

5 Each compound in a library must be screened by an appropriate assay against the 
target. Techniques for handling the screening of several thousands compounds si- 
multaneously have been developed and are generally referred to as high-throughput 
screening techniques. To push the limit of compounds possible to screen simultane- 
ously, different manufactures have been developing instrumentation capable of 
10 handling multiple micro titer plate formats on the same platform using 384 and 1536- 
well plates. Advances in small volume liquid dispensing and pipetting, reliable han- 
dling of standardized plates and simplified assay formats all have made an impact 
on the reliability of the high-throughput screening process. 

1 5 However, high-throughput screening has the disadvantage that each of the com- 
pounds have to be positioned in spatially discrete regions, usually in wells of a micro 
titer plate in order to observe an interaction with a target. If more than a single com- 
pound is present, it is not feasible to discern which compound displaying the appro- 
priate biological activity. Thus, the full power of combinatorial chemistry cannot be 

20 applied because a collection of compounds usually is produced in a single con- 
tainer. 

To be able to select a possible lead compound in a collection of compounds placed 
in the same contained, libraries of bifunctional complexes have been evolved. Each 

25 bifunctional complex in the library comprises a potential lead compound coupled to 
an identifier sequence. The identifier sequence is suitably a nucleic acid which 
uniquely identifies the potential lead compound. When a library of bifunctional com- 
plexes is screened against a target, one or more of the potential leads may bind to 
the target. After removal of the remainder of the library, the binding bifunctional 

30 complexes can be eluated and the lead compound identified by sequencing the 
identifier. 

Various techniques for producing bifunctional complexes are known from the prior 
art. Some attempts to form the complex comprising a molecule as well as the identi- 
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fier that codes therefore, are based on the split-and-mix principle known from com- 
binatorial chemistry, see e.g. WO 93/06121 A1, EP 643 778 B1, and WO 00/23458. 
Other attempts have focussed on the formation of encoded proteins using the natu- 
ral machinery of a cell and connecting the formed protein with the template nucleic 
acid that has coded for the amino acid components of the protein. Examples of suit- 
able systems are phage display, E. coli display, ribosome display (WO 93/03172), 
and protein-mRNA-fusions (WO 98/31700). The genetic information of the nucleic 
acid, usually mRNA or DNA, may not necessarily be decoded between each round 
of selection to establish the identity of the chemical entities that has formed the 
protein because the nucleic acid can be amplified by known means, such as PCR, 
and processed for the formation of a new library enriched in respect of suitable 
binding proteins. 

Recently, a method for encoding molecules has been suggested, which can be 
performed in several selection rounds without intermediate decoding, wherein the 
encoded molecule is not restricted to peptides and proteins. WO 02/00419 and WO 
02/103008 disclose methods for preparing virtually any molecule connected to an 
identifier coding for chemical entities which have reacted to form the molecule. In 
short, a template segregated into a plurality of codons and a plurality of building 
blocks comprising a transferable chemical entity and an anticodon are initially pro- 
vided. Under hybridisation conditions, the template and building blocks are annealed 
together and the chemical entities are subsequently reacted to form the molecule. 

The present invention aims at providing an efficient method for identifying molecules 
having affinity towards a target using a library of bifunctional complexes. 



Summary of the Invention 

The present invention concerns a method for identifying a synthetic molecule having 
affinity towards a target, comprising the steps of 

a) providing a library of bifunctional complexes, wherein each complex of 
the library comprises a synthetic molecule attached to an identifier, which codes for 
said molecule, 

b ) subjecting, under binding conditions, the library of bifunctional com- 
plexes to a target, 

c) removing the non-binding members of the library, 
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d) separating the identifiers of complexes comprising synthetic molecules 
having affinity towards the target, and 

e) decoding the identifiers to establish the identity of the molecule. 

5 The present invention offers a novel method for enrichment of libraries of Afunc- 
tional complexes. In certain aspects, the method takes advantage of retaining the 
non-specific binding complexes bound to the target, walls of wells, beads etc while 
the identifiers of the complexes having specific affinity towards the target are sepa- 
rated. Thus, the present invention provides in certain aspects a method not only 
10 selecting such complexes which survives a selection assay, as the non-specific 
binding complexes are excluded from the pool of identifier separated. 

While it is not desired to be bound by any particular theory, it is presently believed 
that the identifier part of the complexes possesses an inherent affinity towards the 

15 surfaces of a reaction chamber or parts of the target not involved in the binding of an 
agonist or antagonist. The separation step ensures that the non-specific binding 
complexes are retained in the vessel. The identifiers attached to the synthetic mole- 
cule may be separated by cleaving different kind of linkages. Preferably the linkage 
is selectively cleavable, i.e. when the media comprising the library of complexes is 

20 exposed to a certain condition, only the intended linkage is cleaved. 



In one embodiment of the invention, a cleavable linker moiety is situated between 
the synthetic molecule and the identifier. When the complex is subjected to condi- 
tions cleaving the linker, only such identifiers that are not adhered to solid surfaces 

25 etc. are liberated into the liquid media. The identifiers in the liquid media may easily 
be recovered, thereby separating these from the non-specific binding identifiers re- 
maining immobilized. In another embodiment of the invention, a cleavable linker is 
positioned between a target and a solid surface, thereby immobilizing the target. 
Subsequent to exposing the immobilized target to a library of complexes and re- 

30 moving the non-binding part of the library, the complexes attached to the target may 
separated by exposing the media to a condition cleaving the linkage between the 
target and the solid surface. If desirable the identifier can be isolated and amplified 
using standard molecular methods, such as PCR. 
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The second embodiment may be combined with chromatography. Following the 
separation of the complexes attached to the target, the media may be subjected to 
chromatography, e.g. size exclusion chromatography. In certain embodiments after 
the chromatography process another orthogonal cleavable linkage between the 
5 synthetic molecule and the identifier may be cleaved to liberate the identifier. 

The invention also relates to a library of complexes, in which each different complex 
comprises a synthetic molecule, attached via a cleavable linkage to an identifier 
which codes for said molecule. The linkage may be cleaved by a variety of different 
10 conditions, such as electromagnetic radiation, chemical agents, and enzymes. In a 
preferred aspect the selective cleavable linkage comprises a chemical moiety, which 
can be cleaved by electromagnetic irradiation, such as light having a specified wave 
length. A useful chemical moiety for the cleavable linkage comprises a group 

N0 2 R 2 in which R 1 and R 2 are either of the synthetic molecule or the 
1 5 identifier, respectively, R 3 is H or OCH 3 , and X is O, S, or NH. The identifiers pref- 
erably comprise a sequence of nucleotides. In one aspect of the invention, the iden- 
tifier comprises 2 or more codons, which codes for 2 or more chemical entities in- 
corporated into the synthetic molecule. 



hv 
P 



20 Complex 

The complex of the present invention comprises a synthetic molecule and an identi- 
fier. The identifier comprises identifying moieties that identifies the molecule. Pref- 
erably, the identifier identifies the molecule uniquely, i.e. in a library of complexes a 
particular identifier is capable of distinguishing the molecule it is attached to from the 
25 rest of the molecules. 



The molecule and the identifier may be attached directly to each other or through a 
bridging moiety. In one aspect of the invention, the bridging moiety is a selectively 
cleavable linkage. The identifying moieties of each complex suitably comprise rec- 
30 ognition units, i.e. units which may be recognized by a detecting entity. A variety of 
different kinds of recognition exist in nature. Examples include antibodies which rec- 
ognize an epitope, proteins which recognize another protein, mRNA which recog- 
nize a protein, small molecules (like biotin) which recognize a protein (like avidine or 
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streptavidine) and oligonucleotides which recognizes complementing oligonucleo- 
tides. Generally it is preferred that the identifier is a sequence of nucleotides. 



The method may in certain embodiments be performed without amplification after 
5 the separation step. However, when larger libraries are used and the amount of 
separated identifiers is relatively low, it is in general preferred to use an identifier 
which is amplifiable. Identifiers comprising a sequence of nucleotides may be ampli- 
fied using standard techniques, like PCR. In the event the identifier is a protein, the 
protein may be amplified by attaching the mRNA which encoded the synthesis 
1 0 thereof, generating the cDNA from the mRNA and subjecting said mRNA to a 

translation system. Such system is described in WO 98/31 700 the content of which 
is incorporated herein by reference. An alternative method for amplifying a protein is 
to use phage-displayed proteins. 

1 5 The identifier may comprise two or more codons. The sequence of codons can be 
decoded to identify reactants used in the formation of the molecule. When the iden- 
tifier comprises more than one codon, each member of a pool of building blocks can 
be identified uniquely and the order of codons is informative of the synthesis step 
each member has been incorporated in. 

20 

The sequence of the nucleotides in each codon may have any suitable length. The 
codon may be a single nucleotide or a plurality of nucleotides. In some aspects of 
the invention, it is preferred that each codon independently comprises four or more 
nucleotides, more preferred 4 to 30 nucleotides. 

25 

The identifier will in general have at least two codons arranged in sequence, i.e. 
next to each other. Two neighbouring codons may be separated by a framing se- 
quence. Depending on the encoded molecule formed, the identifier may comprise 
further codons, such as 3, 4, 5, or more codons. Each of the further codons may be 
30 separated by a suitable framing sequence. Preferably, all or at least a majority of the 
codons of the identifier are separated from a neighbouring codon by a framing se- 
quence. The framing sequence may have any suitable number of nucleotides, e.g. 1 
to 20. Alternatively, codons on the identifier may be designed with overlapping se- 
quences. 



35 
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The framing sequence, if present, may serve various purposes. In one setup of the 
invention, the framing sequence identifies the position of the codon. Usually, the 
framing sequence either upstream or downstream of a codon comprises information 
which allows determination of the position of the codons. In another setup, the 
5 frames have alternating sequences, allowing for addition of building blocks from two 
pools in the formation of the library. The framing sequence may also or in addition 
provide for a region of high affinity. The high affinity region may ensure that the hy- 
bridisation of the template with an anti-codon will occur in frame. Moreover, the 
framing sequence may adjust the annealing temperature toa desired level. 

10 

A framing sequence with high affinity can be provided by incorporation of one or 
more nucleobases forming three hydrogen bonds to a cognate nucleobase. Exam- 
ples of nucleobases having this property are guanine and cytosine. Alternatively, or 
in addition, the framing sequence may be subjected to backbone modification. Sev- 
15 eral back bone modifications provides for higher affinity, such as 2'-0-methyl sub- 
stitution of the ribose moiety, peptide nucleic acids (PNA), and 2-4' O-methylene 
cyclisation of the ribose moiety, also referred to as LNA (Locked Nucleic Acid). 

The identifier may comprise flanking regions around the codons. The flanking region 
20 can encompass a signal group, such as a flourophor or a radio active group to allow 
for detection of the presence or absence of a complex or the flanking region may 
comprise a label that may be detected, such as biotin. When the identifier comprises 
a biotin moiety, the identifier may easily be recovered following the separation step. 

25 The flanking regions can also serve as priming sites for amplification reactions, such 
as PCR. The identifier may in certain embodiments comprise an affinity region hav- 
ing the property of being able to hybridise to a building block. 

It is to be understood that when the term identifier is used in the present description 
30 and claims, the identifier may be in the sense or the anti-sense format, i.e. the iden- 
tifier can be a sequence of codons which actually codes for the molecule or can be a 
sequence complementary thereto. Moreover, the identifier may be single-stranded 
or double-stranded, as appropriate. 
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The synthetic molecule part of the complex is generally of a structure expected of 
having an effect on the target. When the target is of pharmaceutical importance, the 
molecule is generally a possible drug candidate. The complex may be formed by 
tagging a library of different possible drug candidates with a tag, e.g. a nucleic acid 
5 tag uniquely identifying each possible drug candidate. In another embodiment of the 
invention, the molecule is encoded, i.e. formed by a variety of reactants which have 
reacted with each other and/or a scaffold molecule. Optionally, this reaction product 
may be post-modified to obtain the final molecule displayed on the complex. The 
post-modification may involve the cleavage of one or more chemical bonds attach- 
10 ing the encoded molecule to the indentifier in order more efficiently to display the 
encoded molecule. 

The formation of an encoded molecule generally starts by a scaffold, i.e. a chemical 
unit having one or more reactive groups capable of forming a connection to another 

15 reactive group positioned on a chemical entity, thereby generating an addition to the 
original scaffold. A second chemical entity may react with a reactive group also 
appearing on the original scaffold or a reactive group incorporated by the first 
chemical entity. Further chemical entities may be involved in the formation of the 
final reaction product. The formation of a connection between the chemical entity 

20 and the nascent encoded molecule may be mediated by a bridging molecule. As an 
example, if the nascent encoded molecule and the chemical entity both comprise an 
amine group a connection between these can be mediated by a dicarboxylic acid. A 
synthetic molecule is in general produced in vitro and may be a naturally occurring 
or an artificial substance. Usually, a synthetic molecule is not produced using the 

25 naturally translation system in an in vitro process. 

The chemical entities that are precursors for structural additions or eliminations of 
the encoded molecule may be attached to a building block prior to the participation 
in the formation of the reaction product leading the final encoded molecule. Besides 
30 the chemical entity, the building block generally comprises an anti-codon. In some 
embodiments the building blocks also comprise an affinity region providing for 
affinity towards the nascent complex. 

Thus, the chemical entities are suitably mediated to the nascent encoded molecule 
35 by a building block, which further comprises an anticodon. The anti-codon serves 



WO 2005/003778 PCT/DK2004/000466 

9 

the function of transferring the genetic information of the building block in 
conjunction with the transfer of a chemical entity. The transfer of genetic information 
and chemical entity may occur in any order, however, it is important that a 
correspondence is maintained in the complex. The chemical entities are preferably 
5 reacted without enzymatic interaction in some aspects of the invention. Notably, the 
reaction of the chemical entities is preferably not mediated by ribosomes or 
enzymes having similar activity. In another aspect of the invention a ribosome is 
used to translate an mRNA into a protein using a tRNA loaded with a natural or 
unnatural amino acid. 

10 

According to certain aspects of the invention the genetic information of the anti- 
codon is transferred by specific hybridisation to a codon on a nucleic acid template. 
Another method for transferring the genetic information of the anti-codon to the 
nascent complex is to anneal an oligonucleotide complementary to the anti-codon 
15 and attach this oligonucleotide to the complex, e.g. by ligation. A still further method 
involves transferring the genetic information of the anti-codon to the nascent 
complex by an extension reaction using a polymerase and a mixture of dNTPs. 



The chemical entity of the building block may in most cases be regarded as a pre- 
20 cursor for the structural entity eventually incorporated into the encoded molecule, in 
other cases the chemical entity provides for the eliminations of chemical units of the 
nascent encoded molecule. Therefore, when it in the present application with claims 
is stated that a chemical entity is transferred to a nascent encoded molecule it is to 
be understood that not necessarily all the atoms of the original chemical entity is to 
25 be found in the eventually formed encoded molecule. Also, as a consequence of the 
reactions involved in the connection, the structure of the chemical entity can be 
changed when it appears on the nascent encoded molecule. Especially, the cleav- 
age resulting in the release of the entity may generate a reactive group which in a 
subsequent step can participate in the formation of a connection between a nascent 
30 complex and a chemical entity. 

The chemical entity of the building block comprises at least one reactive group ca- 
pable of participating in a reaction which results in a connection between the chemi- 
cal entity of the building block and another chemical entity or a scaffold associated 
35 with the nascent complex. The number of reactive groups which appear on the 
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chemical entity is suitably one to ten. A building block featuring only one reactive 
group is used i.a. in the end positions of polymers or scaffolds, whereas building 
blocks having two reactive groups are suitable for the formation of the body part of a 
polymer or scaffolds capable of being reacted further. One, two or more reactive 
5 groups intended for the formation of connections, are typically present on scaffolds. 
Non-limiting examples of scaffolds are opiates, steroids, benzodiazepines, hydan- 
toines, and peptidylphosphonates. 

The reactive group of the chemical entity may be capable of forming a direct con- 
10 nection to a reactive group of the nascent complex or the reactive group of the 
building block may be capable of forming a connection to a reactive group of the 
nascent complex through a bridging fill-in group. It is to be understood that not all 
the atoms of a reactive group are necessarily maintained in the connection formed. 
Rather, the reactive groups are to be regarded as precursors for the structure of the 
15 connection. 

The subsequent cleavage step to release the chemical entity from the building block 
can be performed in any appropriate way. In an aspect of the invention the cleavage 
involves usage of a chemical reagent or an enzyme. The cleavage results in a 

20 transfer of the chemical entity to the nascent encoded molecule or in a transfer of 
the nascent encoded molecule to the chemical entity of the building block. In some 
cases it may be advantageous to introduce new chemical groups as a consequence 
of linker cleavage. The new chemical groups may be used for further reaction in a 
subsequent cycle, either directly or after having been activated. In other cases it is 

25 desirable that no trace of the linker remains after the cleavage. 

In another aspect, the connection and the cleavage is conducted as a simultaneous 
reaction, i.e. either the chemical entity of the building block or the nascent encoded 
molecule is a leaving group of the reaction. In some aspects of the invention, it is 
30 appropriate to design the system such that the connection and the cleavage occur 
simultaneously because this will reduce the number of steps and the complexity. 
The simultaneous connection and cleavage can also be designed such that either 
no trace of the linker remains or such that a new chemical group for further reaction 
is introduced, as described above. 



35 
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The attachment of the chemical entity to the building block, optionally via a suitable 
spacer can be at any entity available for attachment, e.g. the chemical entity can be 
attached to a nucleobase or the backbone. In general, it is preferred to attach the 
chemical entity at the phosphor of the internucleoside linkage or at the nucleobase. 
5 When the nucleobase is used for attachment of the chemical entity, the attachment 
point is usually at the 7 position of the purines or 7-deaza-purins or at the 5 position 
of pyrimidines. The nucleotide may be distanced from the reactive group of the 
chemical entity by a spacer moiety. The spacer may be designed such that the 
conformational spaced sampled by the reactive group is optimized for a reaction 
10 with the reactive group of the nascent encoded molecule. 

The synthetic molecules of the invention may have any chemical structure. In a 
preferred aspect, the synthetic molecule can be any compound that may be 
synthesized in a component-by-component fashion. In some aspects the synthetic 
15 molecule is a linear or branched polymer. In another aspect the synthetic molecule 
is a scaffolded molecule. The term "synthetic molecule" also comprises naturally 
occurring molecules like a-polypeptides etc, however produced in vitro usually in the 
absence of enzymes, like ribosomes. In certain aspects, the synthetic molecule of 
the library is a non-a-polypeptide. 



20 



25 



The synthetic molecule may have any molecular weight. However, in order to be 
orally available, it is in this case preferred that the synthetic molecule has a 
molecular weight less than 2000 Daltons, preferably less than 1000 Dalton, and 
more preferred less than 500 Daltons. 



The size of the library may vary considerably pending on the expected result of the 
inventive method. In some aspects, it may be sufficient that the library comprises 
two, three, or four different complexes. However, in most events, more than two 
different complexes are desired to obtain a higher diversity. In some aspects, the 
30 library comprises 1 ,000 or more different complexes, more preferred 1 ,000,000 or 
more different complexes. The upper limit for the size of the library is only restricted 
by the size of the vessel in which the library is comprised. It may be calculated that a 
vial may comprise up to. 10 14 different complexes. 
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Methods for forming libraries of complexes 

The complexes comprising an identifier having two or more codons that codes for 
reactants that have reacted in the formation of the molecule part of the complex may 
5 be formed by a variety of processes. Generally, the preferred methods can be used 
for the formation of virtually any kind of encode molecule. Suitable examples of pro- 
cesses include prior art methods disclosed in WO 93/20242, WO 93/06121, WO 
00/23458, WO 02/074929, and WO 02/103008, the content of which being incorpo- 
rated herein by reference as well as methods of the present applicant not yet public 
10 available, including the methods disclosed in DK PA 2002 01955 filed 19 December 
2002, and DK PA 2003 00430 filed 20 March 2003. Any of these methods may be 
used, and the entire content of the patent applications are included herein by refer- 
ence. 

15 Below four preferred embodiments are described. A first embodiment disclosed in 
more detail in WO 02/103008 is based on the use of a polymerase to incorporate 
unnatural nucleotides as building blocks. Initially, a plurality of template 
oligonucleotides is provided. Subsequently primers are annealed to each of the 
templates and a polymerase is extending the primer using nucleotide derivatives 

20 which have appended chemical entities. Subsequent to or simultaneously with the 
incorporation of the nucleotide derivatives, the chemical entities are reacted to form 
a reaction product. The encoded molecule may be post-modified by cleaving some 
of the linking moieties to better present the encoded molecule. 

25 Several possible reaction approaches for the chemical entities are apparent. First, 
the nucleotide derivatives can be incorporated and the chemical entities 
subsequently polymerised. In the event the chemical entities each carry two reactive 
groups, the chemical entities can be attached to adjacent chemical entities by a 
reaction of these reactive groups. Exemplary of the reactive groups are amine and 

30 carboxylic acid, which upon reaction form an amide bond. Adjacent chemical entities 
can also be linked together using a linking or bridging moiety. Exemplary of this 
approach is the linking of two chemical entities each bearing an amine group by a 
bi-carboxylic acid. Yet another approach is the use of a reactive group between a 
chemical entity and the nucleotide building block, such as an ester or a thioester 

35 group. An adjacent building block having a reactive group such as an amine may 
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cleave the interspaced reactive group to obtain a linkage to the chemical entity, e.g. 
by an amide linking group. 



A second embodiment for obtainment of complexes pertains to the use of 
5 hybridisation of building blocks to a template and reaction of chemical entities 

attached to the building blocks in order to obtain a reaction product. This approach 
comprises that templates are contacted with a plurality of building blocks, wherein 
each building block comprises an anti-codon and a chemical entity. The anti-codons 
are designed such that they recognise a sequence, i.e: a codon, on the template. 
1 0 Subsequent to the annealing of the anti-codon and the codon to each other a 
reaction of the chemical entity is effected. 

The template may be associated with a scaffold. Building blocks bringing chemical 
entities in may be added sequentially or simultaneously and a reaction of the 
1 5 reactive group of the chemical entity may be effected at any time after the annealing 
of the building blocks to the template. 

A third embodiment for the generation of a complex includes chemical or 
enzymatical ligation of building blocks when these are lined up on a template. 

20 Initially, templates are provided, each having one or more codons. The templates 
are contacted with building blocks comprising anti-codons linked to chemical 
entities. The two or more anti-codons annealed on a template are subsequently 
ligated to each other and a reaction of the chemical entities is effected to obtain a 
reaction product. The method is disclosed in more detail in DK PA 2003 00430 filed 

25 20 March 2003. 



A fourth embodiment makes use of the extension by a polymerase of an affinity 
sequence of the nascent complex to transfer the anti-codon of a building block to the 
nascent complex. The method implies that a nascent complex comprising a scaffold 

30 and an affinity region is annealed to a building block comprising a region 

complementary to the affinity section. Subsequently the anti-codon region of the 
building block is transferred to the nascent complex by a polymerase. The transfer 
of the chemical entity may be transferred prior to, simultaneously with or subsequent 
to the transfer of the anti-codon. This method is disclosed in detail in DK PA 2002 

35 01955 filed 19 December 2002. 
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Thus, the codons are either pre-made into one or more templates before the 
encoded molecules are generated or the codons are transferred simultaneously with 
the formation of the encoded molecules. 

5 

After or simultaneously with the formation of the reaction product some of the linkers 
to the template may be cleaved, however at least one linker must be maintained to 
provide for the complex. 

10 Nucleotides 

The nucleotides used in the present invention may be linked together in a sequence 
of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is normally com- 
posed of two parts, namely a nucleobase moiety, and a backbone. The back bone 
may in some cases be subdivided into a sugar moiety and an internucleoside linker. 

15 

The nucleobase moiety may be selected among naturally occurring nucleobases as 
well as non-naturally occurring nucleobases. Thus, "nucleobase" includes not only 
the known purine and pyrimidine hetero-cycles, but also heterocyclic analogues and 
tautomers thereof. Illustrative examples of nucleobases are adenine, guanine, thy- 

20 mine, cytosine, uracil, purine, xanthine, diaminopurine, 8-oxo-N 6 -methyladenine, 7- 
deazaxanthine, 7-deazaguanine, N 4 ,N 4 -ethanocytosin, N 6 ,N 6 -ethano-2,6-diamino- 
purine, 5-methylcytosine, 5-(C 3 -C 6 )-alkynylcytosine, 5-fluorouracil, 5-bromouracil, 
pseudoisocytosine, 2-hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, 
inosine and the "non-naturally occurring" nucleobases described in Benner et aL, 

25 U.S. Pat No. 5,432,272. The term "nucleobase" is intended to cover these examples 
as well as analogues and tautomers thereof. Especially interesting nucleobases are 
adenine, guanine, thymine, cytosine, 5-methylcytosine, and uracil, which are con- 
sidered as the naturally occurring nucleobases in relation to therapeutic and diag- 
nostic application in humans. 

30 

Examples of suitable specific pairs of nucleobases are shown below: 
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Natural Base Pairs 

R=H: Uracil 

R R=CH 3 : Thymine Cytosine 



O 

NH 2 HN^/^ BaCkbone 

Backbone 

Backbone 



GO 



NH 2 

. , Backbone 
Adenine 

Guanine 



Synthetic Base Pairs 



Backbone 



A. HN Y^ 



NH 2 
Backbone 



rv 



V KIUI 



•Backbone 



y> 0 * 



HN ^ 
O 2 ^ Backbone 

"NH 2 ° 
Backbone 



u 



O 

6 



N-Backbone 



N NHj 
Backbone N= ^ 

l' N-Backbone 

HN^NH 
Backbone 

I N-Backbone 
NH 2 



Backbone 



Synthetic purine bases pairring with natural pyrimidines 



O: 

NH 2 HN* 



R=H: Uracil 
R=CH 3 : Thymine 



Cytosine 



Backbone 



H 2 N 



Backbone 
7-deaza adenine 



C N>> "-Backbone 

(XX T 



Backbone 



7-deaza guanine 



Suitable examples of backbone units are shown below (B denotes a nucleobase): 
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h^j ^ V V ^ B 



Thio-LNA Amino-LNA 
R = -H, -CH 3 



On B o-i B o-i B o-i _ B b-i rt f B 

o do- oV O F o 

o^-s- 0=^-0" o=^-o- V. 0 ^ o=£-o- o=£-o- 



2'-MOE 2'-Fluoro 2*-F-ANA 



On B ? o-^ b B s 

^ ;^ ^ ^ s?; 

2--AP HNA CcNA PNA . Morpholino 

O=P-0" ^ o=U 0=P-BH 3 " 0=P-0" 

OH * 

, , 3'-Pho S phoramidale Bonmophospteles TNA 
2'-(3-hydroxy)propyl 



The sugar moiety of-the backbone is suitably a pentose but may be the appropriate 
5 part of an PNA or a six-member ring. Suitable examples of possible pentoses in- 
clude ribose, 2'-deoxyribose J 2'-0-methyl-ribose, 2'-flour-ribose, and 2 , -4 , -0- 
methylene-ribose (LNA). Suitably the nucleobase is attached to the V position of the 
pentose entity. 

10 An internucleoside linker connects the 3' end of preceding monomer to a 5' end of a 
succeeding monomer when the sugar moiety of the backbone is a pentose, like ri- 
bose or 2-deoxyribose. The internucleoside linkage may be the natural occurring 
phospodiester linkage or a derivative thereof. Examples of such derivatives include 
phc~phorothioate, methylphosphonate, phosphoramidate, phosphotriester, and 

15 phosphodithioate. Furthermore, the internucleoside linker can be any of a number of 
non-phosphorous-containing linkers known in the art. 
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Preferred nucleic acid monomers include naturally occurring nucleosides forming 
part of the DNA as well as the RNA family connected through phosphodiester link- 
ages. The members of the DNA family include deoxyadenosine, deoxyguanosine, 
5 deoxythymidine, and deoxycytidine. The members of the RNA family include adeno- 
sine, guanosine, uridine, cytidine, and inosine. Inosine is a non-specific pairing nu- 
cleoside and may be used as universal base because inosine can pair nearly iso- 
energetically with A, T, and C. Other compounds having the same ability of non- 
specifically base-pairing with natural nucleobases have been formed. Suitable com- 
10 pounds which may be utilized in the present invention includes among others the 
compounds depicted below 
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Examples of Universal Bases: 




M1CS 5M1CS P'M 
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Building block 

The chemical entities that are precursors for structural additions or eliminations of 
the encoded molecule may be attached to a building block prior to the participation 
5 in the formation of the reaction product leading the final encoded molecule. Besides 
the chemical entity, the building block generally comprises an anti-codon. 

The chemical entity of the building block comprises at least one reactive group ca- 
pable of participating in a reaction which results in a connection between the chemi- 

1 0 cal entity of the building block and another chemical entity or a scaffold associated 
with the nascent complex. The connection is facilitated by one or more reactive 
groups of the chemical entity. The number of reactive groups which appear on the 
chemical entity is suitably one to ten. A building block featuring only one reactive 
group is used i.a. in the end positions of polymers or scaffolds, whereas building 

1 5 blocks having two reactive groups are suitable for the formation of the body part of a 
polymer or scaffolds capable of being reacted further. One, two or more reactive 
groups intended for the formation of connections, are typically present on scaffolds. 

The reactive group of the building block may be capable of forming a direct connec- 
20 tion to a reactive group of the nascent complex or the reactive group of the building 
block may be capable of forming a connection to a reactive group of the nascent 
complex through a bridging fill-in group. It is to be understood that not all the atoms 
of a reactive group are necessarily maintained in the connection formed. Rather, the 
reactive groups are to be regarded as precursors for the structure of the connection. 

25 

The subsequent cleavage step to release the chemical entity from the building block 
can be performed in any appropriate way. In an aspect of the invention the cleavage 
involves usage of a reagent or an enzyme. The cleavage results in a transfer of the 
chemical entity to the nascent encoded molecule or in a transfer of the nascent en- 
30 coded molecule to the chemical entity of the building block. In some cases it may be 
advantageous to introduce new chemical groups as a consequence of linker cleav- 
age. The new chemical groups may be used for further reaction in a subsequent 
cycle, either directly or after having been activated. In other cases it is desirable that 
no trace of the linker remains after the cleavage. 
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In another aspect, the connection and the cleavage is conducted as a simultaneous 
reaction, i.e. either the chemical entity of the building block or the nascent encoded 
molecule is a leaving group of the reaction. In general, it is preferred to design the 
system such that the connection and the cleavage occur simultaneously because 
5 this will reduce the number of steps and the complexity. The simultaneous connec- 
tion and cleavage can also be designed such that either no trace of the linker re- 
- mains or such that a new chemical group for further reaction is introduced, as de- 
scribed above. 

1 0 The attachment of the chemical entity to the building block, optionally via a suitable 
spacer can be at any entity available for attachment, e.g. the chemical entity can be 
attached to a nucleobase or the backbone. In general, it is preferred to attach the 
chemical entity at the phosphor of the internucleoside linkage or at the nucleobase. 
When the nucleobase is used for attachment of the chemical entity, the attachment 

1 5 point is usually at the 7 position of the purines or 7-deaza-purins or at the 5 position 
of pyrimidines. The nucleotide may be distanced from the reactive group of the 
chemical entity by a spacer moiety. The spacer may be designed such that the 
conformational space sampled by the reactive group is optimized for a reaction with 
the reactive group of the nascent encoded molecule or reactive site. 

20 

The anticodon complements the codon of the identifier sequence and generally 
comprises the same number of nucleotides as the codon. The anticodon may be 
adjoined with a fixed sequence, such as a sequence complementing a framing se- 
quence. 

25 

Various specific building blocks are envisaged. Building blocks of particular interest 
are shown below. 

Building blocks transferring a chemical entity to a recipient nucleophilic group 
30 The building block indicated below is capable of transferring a chemical entity (CE) 
to a recipient nucleophilic group, typically an amine group. The bold lower horizontal 
line illustrates the building block and the vertical line illustrates a spacer. The 5- 
membered substituted N-hydroxysuccinimid (NHS) ring serves as an activator, i.e. a 
labile bond is formed between the oxygen atom connected to the NHS ring and the 
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chemical entity. The labile bond may be cleaved by a nucleophilic group, e.g. posi- 
tioned on a scaffold 




5 

The 5-membered substituted N-hydroxysuccinimid (NHS) ring serves as an activa- 
tor, i.e. a labile bond is formed between the oxygen atom connected to the NHS ring 
and the chemical entity. The labile bond may be cleaved by a nucleophilic group, 
e.g. positioned on a scaffold, to transfer the chemical entity to the scaffold, thus 

10 converting the remainder of the fragment into a leaving group of the reaction. 

When the chemical entity is connected to the activator through an carbonyl group 
and the recipient group is an amine, the bond formed on the scaffold will an amide 
bond. The above building block is the subject of the Danish patent application No. 
PA 2002 01946 and the US provisional patent application No. 60/434,439, the con- 

1 5 tent of which are incorporated herein in their entirety by reference. 

Another building block which may form an amide bond is 



-CE 



20 R may be absent or N0 2) CF 3 , halogen, preferably C!, Br t or I, and Z may be S or O. 
This type of building block is disclosed in Danish patent application No. PA 2002 
0951 and US provisional patent application filed 20 December 2002 with the title "A 
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building block capable of transferring a functional entity to a recipient reactive 
group". The content of both patent application are incorporated herein in their en- 
tirety by reference. 

5 A nucleophilic group can cleave the linkage between Z and the carbonyl group 
thereby transferring the chemical entity -(C=0)-CE f to said nucleophilic group. 

Building blocks transferring a chemical entity to a recipient reactive group forming a 
C=C bond 

10 A building block as shown below are able to transfer the chemical entity to a recipi- 
ent aldehylde group thereby forming a double bond between the carbon of the alde- 
hyde and the chemical entity 




15 I 

The above building block is comprised by the Danish patent application No. DK PA 
2002 01952 and the US provisional patent application filed 20 December 2002 with 
the title "A building block capable of transferring a functional entity to a recipient re- 
20 active group forming a C=C double bond". The content of both patent applications 
are incorporated herein in their entirety by reference. 

Building blocks transferring a chemical entity to a recipient reactive group forming a 
C-C bond 

25 The below building block is able to transfer the chemical entity to a recipient group 
thereby forming a single bond between the receiving moiety, e.g. a scaffold, and the 
chemical entity. 
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-CE 



The above building block is comprised by the Danish patent application No. DK PA 
2002 01947 and the US provisional patent application No 60/434,428. The content 
of both patent applications are incorporated herein in their entirety by reference. 



Another building block capable of transferring a chemical entity to a receiving reac- 
tive group forming a single bond is 




10 

The receiving group may be a nucleophile, such as a group comprising a hetero 
atom, thereby forming a single bond between the chemical entity and the hetero 
atom, or the receiving group may be an electronegative carbon atom, thereby form- 
ing a C-C bond between the chemical entity and the scaffold. 

15 

The chemical entity attached to any of the above building blocks may be a selected 
from a large arsenal of chemical structures. Examples of chemical entities are 
H or entities selected among the group consisting of a CrC 6 alkyl, C 2 -C 6 alkenyl, 
C 2 -C 6 alkynyl, C 4 -C B alkadienyl, C 3 -C 7 cyclpalkyl, C 3 -C 7 cycloheteroalkyl, aryl, and 
20 heteroaryl, said group being substituted with 0-3 R\ 0-3 R 5 and 0-3 R 9 or C r C 3 ai- 
kylene-NR 4 2 , C r C 3 alkyIene-NR 4 C(0)R 8 f C r C 3 alkylene-NR 4 C(0)OR 8 , d-C 2 al- 
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kylene-0-NR 4 2 , C r C 2 alkylene-0-NR 4 C(0)R 8 , d-C 2 alkylene-0-NR 4 C(0)0R 8 sub- 
stituted with 0-3 R 9 . 

where R 4 is H or selected independently among the group consisting of 
C-Ce alkyl, C 2 -C 6 alkenyl. C 2 -C 6 alkynyl, C 3 -C 7 cycloalkyl, C 3 -C 7 cycloheteroalkyl, 
5 aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , -NHOH, 
-NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of C 2 -C 6 
alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl said group being substituted with 0-2 R 7 , 

where R 6 is selected independently from H, C r C 6 alkyl, C 3 .C 7 cycloal- 
10 kyl, aryl or C r C 6 alkylene-aryl substituted with 0-5 halogen atoms selected from -F, 
-CI, -Br, and -I; and R 7 is independently selected from -IM0 2 , -COOR 6 , -COR 6 , -CN, 
-OSiR 6 3 , -OR 6 and -NR 6 2 . 

R 8 is H, C r C 6 alkyl, C r C 6 alkenyl, C 2 -C B alkynyl, C 3 -C 7 cycloalkyl, aryl or C,-C 6 
alkylene-aryl substituted with 0-3 substituents independently selected from -F, -CI, - 
15 N0 2 , -R 3 , -OR 3 , -SiR 3 3 

R 9 is =0. -F, -CI, -Br, -I, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , -NR 6 -C(0)OR 8 , -SR 6 , 
-S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and -S(0) 2 NR 6 2 . 

Cross-link cleavage building blocks 
20 It may be advantageous to split the transfer of a chemical entity to a recipient reac- 
tive group into two separate steps, namely a cross-linking step and a cleavage step 
because each step can be optimized. A suitable building block for this two step pro- 
cess is illustrated below: 




o 



A 



25 



Initally, a reactive group appearing on the functional entity precursor (abbreviated 
FEP) reacts with a recipient reactive group, e.g. a reactive group appearing on a 
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scaffold, thereby forming a cross-link. Subsequently, a cleavage is performed, usu- 
. ally by adding an aqueous oxidising agent such as l 2 , Br 2 , Cl 2 , H + , or a Lewis acid. 
The cleavage results in a transfer of the group HZ-FEP- to the recipient moiety, such 
as a scaffold. 

.5 

In the above formula 

Z is O, S, NR 4 
Q is N, CR 1 

P is a valence bond, O, S, NR 4 , or a group C 5 . 7 arylene, C^alkylene, 
10 Ci. 6 0-alkylene, C^S-alkylene, NR 1 -alkylene, Ci^alkylene-O, C^alkylene-S option 
said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or C r C 3 alkylene-NR 4 2 , 
C r C 3 alkylene-NR 4 C(0)R 8 , d-C 3 alkylene-NR 4 C(0)OR 8 , C r C 2 alkylene-0-NR 4 2l 
C r C 2 alkylene-0-NR 4 C(0)R 8 , C r C 2 alkylene-0-NR 4 C(0)OR B substituted with 0-3 
R 9 , 

15 B is a group comprising D-E-F, in which 

D is a valence bond or a group C^alkylene, C^alkenylene, Ci. 
6 alkynylene, C 5 . 7 arylene, or C 5 _ 7 heteroarylene, said group optionally being substi- 
tuted with 1 to 4 group R 11 , 

E is, when present, a valence bond, O, S, NR 4 , or a group Ci. 
20 6 alkylene, C^alkenylene, C^alkynylene, C 5 . 7 arylene, or C^heteroarylene, said 
group optionally being substituted with 1 to 4 group R 11 , 

F is, when present, a valence bond, O, S, or NR 4 , 
A is a spacing group distancing the chemical structure from the com- 
plementing element, which may be a nucleic acid, 
25 R\ R 2 , and R 3 are independent of each other selected among the 

group consisting of H, C r C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl, 
C3-C7 cycloalkyl, C 3 -C 7 cycloheteroalkyl, aryl, and heteroaryl, said group being sub- 
stituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or C r C 3 alkylene-NR 4 2t C r C 3 al- 
kylene-NR 4 C(0)R 8 , C r C 3 alkylene-NR 4 C(0)OR 8 , C r C 2 alkylene-0-NR 4 2 , C r C 2 al- 
30 kylene-0-NR 4 C(0)R 8 , C r C 2 alkylene-0-NR 4 C(O)0R 8 substituted with 0-3 R 9 , 

FEP is a group selected among the group consisting of H, Ci-C 6 alkyl, 
C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl, C 3 -C 7 cycloalkyl, C 3 -C 7 cycloheteroal- 
kyl, aryl, and heteroaryl, said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 
or Ci-C 3 alkylene-NR 4 2 , C r C 3 alkylene-NR 4 C(0)R 8 , C r C 3 alkylene-NR 4 C(0)OR 8 , 
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Ci-C 2 alkylene-0-NR 4 2 , C r C 2 alkylene-0-NR 4 C(0)R B , d-C 2 al- 
kyiene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 

where R 4 is H or selected independently among the group consisting of 
Ci-C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 -C 7 cycloalkyl, C r C 7 cycloheteroalkyl, 
5 aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

R 5 is selected independently from -N 3l -CNO, -C(NOH)NH 2 , -NHOH, 
-NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2l -P(0)(OR 6 ) 2 or the group consisting of C 2 -C 6 
alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl said group being substituted with 0-2 R 7 , 

where R 6 is selected independently from H, C r C 6 alkyl, C 3 .C 7 cycloal- 
10 kyl, aryl or CrC 6 alkylene-aryl substituted with 0-5 halogen atoms selected from -F, 
-CI, -Br, and -I; and R 7 is independently selected from -N0 2 , -COOR 6 , -COR 6 , -CN, 
-OSiR 6 3 , -OR 6 and -NR 6 2 . 

R 8 is H, C r Ce alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 -C 7 cycloalkyl, aryl or C r C 6 
alkylene-aryl substituted with 0-3 substituents independently selected from -F, -CI, - 
15 N0 2 , -R 3 , -OR 3 , -SiR 3 3 

R 9 is =0 , -F, -CI, -Br, -I, -CN, -N0 2> -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , -NR 6 -C(0)OR 8 , -SR 6 , 
-S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and -S(0) 2 NR 6 2 . 

In a preferred embodiment Z is O or S, P is a valence bond, Q is CH, B is CH 2 , and 
20 R 1 , R 2 , and R 3 is H. The bond between the carbonyl group and Z is cleavable with 
aqueous l 2 . 

Contacting between target and library 

The contacting step, by which the library of bifunctional molecules is subjected un- 
25 der binding conditions to a target, may be referred to as the enrichment step or the 
selection step, as appropriate, and includes the screening of the library for synthetic 
molecules having predetermined desirable characteristics. Predetermined desirable 
characteristics can include binding to a target, catalytically changing the target, 
chemically reacting with a target in a manner which alters/modifies the target or the 
30 functional activity of the target, and covalently attaching to the target as in a suicide 
inhibitor. 

In theory, molecules of interest can be selected based on their properties using ei- 
ther physical or physiological procedures. The method preferred according to the 
35 present invention is to enrich molecules with respect to binding affinity towards a 
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target of interest. In a certain embodiment, the basic steps involve mixing the library 
of complexes with the immobilized target of interest. The target can be attached to a 
column matrix or microtitre wells with direct immobilization or by means of antibody 
binding or other high-affinity interactions. In another embodiment, the target and 
5 displayed molecules interact without immobilisation of the target. Displayed mole- 
cules that bind to the target will be retained on this surface, while nonbinding dis- 
played molecules will be removed during a single or a series of wash steps. The 
identifiers of complexes bound to the target can then be separated by cleaving the 
physical connection to the synthetic molecule. It may be considered advantageously 
10 to perform a chromatography step after of instead of the washing step. After the 

cleavage of the physical link between the synthetic molecule and the identifier, the 
identifier may be recovered from the media and optionally amplified before the de- 
coding step. 

15 A significant reduction in background binders may be obtained with increased 

washing volumes, repeating washing steps, higher detergent concentrations and 
prolonged incubation during washing. Thus, the more volume and number of steps 
used in the washing procedure together with more stringent conditions will more 
efficiently remove non-binders and background binders. The right stringency in the 

20 washing step can also be used to remove low-affinity specific binders. However, the 
washing step will also remove wanted binders if too harsh conditions are used. 

A blocking step, such as incubation of solid phase with skimmed milk proteins or 
other inert proteins and/or mild detergent such as Tween-20 and Triton X-100, may 
25 also be used to reduce the background. The washing conditions should be as strin- 
gent as possible to remove background binding but to retain specific binders that 
interact with the immobilized target. Generally, washing conditions are adjusted to 
maintain the desired affinity binders, e.g. binders in the micromolar, nanomolar, or 
pocomolar range. 

30 

In traditional elution protocols, false positives due to suboptimal binding and wash- 
ing conditions are difficult to circumvent and may require elaborate adjustments of 
experimental conditions. However, an enrichment of more than 100 to 1000 is rarely 
obtained. The present invention alleviates the problem with false positive being ob- 
35 tained because the non-specific binding complexes to a large extent remain in the 
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reaction chamber. The experiments reported herein suggest that an enrichment of 
more than 10 7 can be obtained. 

The target can be any compound of interest. E.g. the target can be a protein, pep- 
5 tide, carbohydrate, polysaccharide, glycoprotein, hormone, receptor, antigen, anti- 
body, virus, substrate, metabolite, transition state analogue, cofactor, inhibitor, drug, 
dye, nutrient, growth factor, cell, tissue, etc. without limitation. Suitable targets in- 
clude, but are not limited to, angiotensin converting enzyme, renin, cyclooxygenase, 
5-lipoxygenase, IIL- 1 0 converting enzyme, cytokine receptors, PDGF receptor, 

10 type II iriosine monophosphate dehydrogenase, p-lactamases, integrin, proteases 
like factor Vila, kinases like Bcr-Abl/Her, phosphotases like PTP-1B, and fungal cy- 
tochrome P-450. Targets can include, but are not limited to, bradykinin, neutrophil 
elastase, the HIV proteins, including tat, rev, gag, int, RT, nucleocapsid etc., VEGF, 
bFGF, TGFP, KGF, PDGF, GPCR, thrombin, substance P, IgE, sPLA2, red blood 

15 cells, glioblastomas, fibrin clots, PBMCs, hCG, lectins, selectins, cytokines, ICP4, 
complement proteins, etc. 

A target can also be a surface of a non-biological origin, such as a polymer surface 
or a metal surface. The method of the invention may then be used to identify suit- 
20 able coatings for such surfaces. 

In a preferred embodiment, the desirable synthetic molecule acts on the target with- 
out any interaction between the nucleic acid attached to the desirable encoded 
molecule and the target. In one embodiment, the bound complex-target aggregate 
25 can be partitioned from unbound complexes by a number of methods. The methods 
include nitrocellulose filter binding, column chromatography, filtration, affinity chro- 
matography, centrifugation, and other well known methods. A preferred method is 
size-exclusion chromatography. 

30 Briefly, the library of complexes is subjected to the target, which may include contact 
between the library and a column onto which the target is immobilised. Identifiers 
associated with undesirable synthetic molecules, i.e. synthetic molecules not bound 
to the target under the stringency conditions used, will pass through the column. 
Additional undesirable synthetic molecules (e.g. synthetic molecules which cross- 

35 react with other targets) may be removed by counter-selection methods. Desirable 
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complexes are bound to the column. The target may be immobilized in a number of 
ways. In one embodiment, the target is immobilized through a cleavable physical 
link, such as one more chemical bonds. The aggregate of the target and the com- 
plex may then be subjected to a size exclusion chromatography to separate the ag- 
5 gregate from the rest of the compounds in the media. The complex may then be 
eluted from the target by changing the conditions (e.g., salt, pH, surfactant, tem- 
perature etc.). Alternatively, the complex may be provided with a cleavable linker, 
preferable orthogonal to the cleavable linker that attached the target to the solid 
support, at a position between the synthetic molecule and the identifier. Subsequent 

10 to the size exclusion chromatography this cleavable linker is cleaved to separate the 
identifiers of complexes having affinity towards the targets. Just to mention a single 
type of orthogonal cleavable linkages, one could attached to target to the solid sup- 
port through a linkage that can be cleaved by a chemical agent, and the linker sepa- 
rating the synthetic molecule and the identifier may be selected as a photocleavable 

15 linkage: More specifically, the former linkage may be a disulphide bond that can be 
cleaved by a suitable reducing agent like DTT (dithiothreitol) and the latter linkage 
may be a o-nitrophenyl group. 

There are other partitioning and screening processes which are compatible with this 
20 invention that are known to one of ordinary skill in the art. In one embodiment, the 
products can be fractionated by a number of common methods and then each frac- 
tion is then assayed for activity. The fractionization methods can include size, pH, 
hydrophobicity, etc. 

25 Inherent in the present method is the selection of encoded molecules on the basis of 
a desired function; this can be extended to the selection of molecules with a desired 
function and specificity. Specificity can be required during the selection process by 
first extracting complexes which are capable of interacting with a non-desired "tar- 
get" (negative selection, or counter-selection), followed by positive selection with the 

30 desired target. As an example, inhibitors of fungal cytochrome P-450 are known to 
cross-react to some extent with mammalian cytochrome P-450 (resulting in serious 
side effects). Highly specific inhibitors of the fungal cytochrome could be selected 
from a library by first removing those complexes capable of interacting with the 
mammalian cytochrome, followed by retention of the remaining products which are 
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capable of interacting with the fungal cytochrome. 
Cleavable linkers 

A cleavable linker may be positioned between the target and a solid support or be- 
tween the potential drug candidate and the identifier or any other position that may 
ensure a separation of the identifier from successful complexes from non-specific 
binding complexes. The cleavable linker may be selectively cleavable, i.e. conditions 
may selected that only cleave that particular linker. 

The cleavable linkers may be selected from a large plethora of chemical structures. ' 
Examples of linkers includes, but are not limited to, linkers having an enzymatic 
cleavage site, linkers comprising a chemical degradable component, linkers 
cleavable by electromagnetic radiation. 

Examples of linkers cleavable by electromagnetic radiation (light) 
o-nitrobenzyl p-alkoxy 




o-nitrobenzyl in exo position 




N0 2 R 2 

For more details see Holmes CP. J. Org. Chem. 1997, 62, 2370-2380 



3-nitrophenyloxy 




0 2 N hv . 

For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 
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Dansyl derivatives: 



R 2 




0=S=0 




For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 



5 



Coumarin derivatives 
^NR 2 R 3 




hv 



H~NR 2 R 3 



R 1 -0' 



*0 



H-Donor 



For more details see R. O. Schoenleber, B. Giese. Synlett 2003, 501-504 

10 

R 1 and R 2 can be either of the potential drug candidate and the identifier, respec- 
tively. Alternatively, R 1 and R 2 can be either of the target or a solid support, respec- 
tively. 

R 3 = H or OCH 3 
15 If X is O then the product will be a carboxylic acid 
If X is NH the product will be a carboxamide 

One specific example is the PC Spacer Phosphoramidite (Glen research catalog # 
10-4913-90) which can be introduced in an oligonucleotide during synthesis and 
20 cleaved by subjecting the sample in water to UV light (~ 300-350 nm) for 30 sec- 
onds to 1 minute. 
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DMTO^^A 




10 



15 



20 



DMT = 4,4'-Dimethoxytrityl 
iPr = Isopropyl 
CNEt = Cyanoethyl 

The above PC spacer phosphoamidite is suitable incorporated in a library of com- 
plexes at a position between the indentifier and the potential drug candidate. The 
spacer may be cleaved according to the following reaction. 



R 1 and R 2 can be either of the encoded molecule and the identifying molecule, re- 
spectively. In a preferred aspect R 2 is an oligonucleotide identifier and the R 1 is the 
potential drug candidate. When the linker is cleaved a phosphate group is generated 
allowing for further biological reactions. As an example, the phosphate group may 
be positioned in the 5'end of an oligonucleotide allowing for an enzymatic ligation 
process to take place. 

Examples of linkers cleavable by chemical agents: 




Ester linkers can be cleaved by nucleophilic attack using e.g. hydroxide ions. In 
practice this can be accomplished by subjecting the target-ligand complex to a base 
for a short period. 
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R 3 R 4 R 3 R 4 




OH- O 

— - *^A r 




OH + 

R 5 R 6 R 5 X R 6 

R 1 and R 2 can be the either of be the potential drug candidate or the identifier, re- 
5 spectively. R 4 * 6 can be any of the following: H, CN, F, N0 2l S0 2 NR 2 . 

Disulfide linkers can efficiently be cleaved / reduced by Tris (2-carboxyethyl) phos- 
phine (TCEP). TCEP selectively and completely reduces even the most stable wa- 
ter-soluble alkyl disulfides over a wide pH range. These reductions frequently re- 
10 quired less than 5 minutes at room temperature. TCEP is a non-volatile and odor- 
less reductant and unlike most other reducing agents, it is resistant to air oxidation. 
Trialkylphosphines such as TCEP are stable in aqueous solution, selectively reduce 
disulfide bonds, and are essentially unreactive toward other functional groups com- 
monly found in proteins. 

15 



O^.OH 



R'-S-S-R 2 



TCEP 



O^OH 



HO 1 OH 
H 2 0 R 1 -SH + HS-R 2 + 1 /-^ 

o ^— P— / o 

II 

o 



More details on the reduction of disulfide bonds can be found in Kirley, T.L(1989), 
Reduction and fluorescent labeling of cyst(e)ine-containing proteins for subsequent 
structural analysis, Anal. Biochem. 180, 231 and Levison, M.E., et al. (1969), Re- 
20 duction of biological substances by water-soluble phosphines: Gamma-globulin. 
Experentia 25, 126-127. 



Linkers cieavable by enzymes 

The linker connecting the potential drug candidate with the identifier or the solid 
25 support and the target can include a peptide region that allows a specific cleavage 
using a protease. This is a well-known strategy in molecular biology. Site-specific 
proteases and their cognate target amino acid sequences are often used to remove 
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the fusion protein tags that facilitate enhanced expression, solubility, secretion or 
purification of the fusion protein. 

Various proteases can be used to accomplish a specific cleavage. The specificity is 
especially important when the cleavage site is presented together with other se- 
5 quences such as for example the fusion proteins. Various conditions have been op- 
timized in order to enhance the cleavage efficiency and control the specificity. These 
conditions are available and know in the art. 



Enterokinase is one example of an enzyme (serine protease) that cut a specific 
10 amino acid sequence. Enterokinase recognition site is Asp-Asp-Asp-Asp-Lys 

(DDDDK), and it cleaves C-terminally of Lys. Purified recombinant Enterokinase is 
commercially available and is highly active over wide ranges in pH (pH 4.5-9.5) and 
temperature (4-45°C). 

15 The nuclear inclusion protease from tobacco etch virus (TEV) is another commer- 
cially available and well-characterized proteases that can be used to cut at a specific 
amino acid sequence. TEV protease cleaves the sequence Glu-Asn-Leu-Tyr-Phe- 
Gln-Gly/Ser (ENLYFQG/S) between Gln-Gly or Gln-Ser with high specificity. 

20 Another well-known protease is thrombin that specifically cleaves the sequence Leu- 
Val-Pro-Arg-Gly-Ser (LVPAGS) between Arg-Gly. Thrombin has also been used for 
cleavage of recombinant fusion proteins. Other sequences can also be used for 
thrombin cleavage; these sequences are more or less specific and more or less effi- 
ciently cleaved by thrombin. Thrombin is a highly active protease and various reac- 

25 tion conditions are known to the public. 

Activated coagulation factor FX (FXa) is also known to be a specific and useful pro- 
tease. This enzyme cleaves C-terminal of Arg at the sequence lle-Glu-Gly-Arg 
(IEGR). FXa is frequently used to cut between fusion proteins when producing pro- 
30 teins with recombinant technology. Other recognition sequences can also be used 
for FXa. 

Other types of proteolytic enzymes can also be used that recognize specific amino 
acid sequences. In addition, proteolytic enzymes that cleave amino acid sequences 
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in an un-specific manner can also be used if only the linker contains ah amino acid 
sequence in the complex molecule. 

Other type of molecules such as ribozymes, catalytically active antibodies, or lipases . 
5 can also be used. The only prerequisite is that the catalytically active molecule can 
cleave the specific structure used as the linker, or as a part of the linker, that con- 
nects the encoding region and the displayed molecule or, in the alternative the solid 
support and the target. 

10 A variety of endonucleases are available that recognize and cleave a double 

stranded nucleic acid having a specific sequence of nucleotides. The endonuclease 
Eco Rl is an example of a nuclease that efficiently cuts a nucleotide sequence linker 
comprising the sequence GAATTC also when this sequence is close to the nucleo- 
tide sequence length. Purified recombinant Eco Rl is commercially available and is 

15 highly active in a range of buffer conditions. As an example the Eco Rl is working in 
in various protocols as indicted below (NEBuffer is available from New England 
Biolabs): 

NEBuffer 1 : [10 mM Bis Tris Propane-HCI, 10 mM MgCI 2 , 1 mM dithiothreitol (pH 
7.0 at 25°C)], 

20 NEBuffer 2 : [50 mM NaCI, 10 mM Tris-HCI, 10 mM MgCI 2 , 1 mM dithiothreitol (pH 
7.9 at 25°C)], 

NEBuffer 3 : [100 mM NaCI, 50 mM Tris-HCI, 10 mM MgCI 2 , 1 mM dithiothreitol (pH 
7.9 at 25°C)], 

NEBuffer 4 : [50 mM potassium acetate, 20 mM Tris-acetate, 10 mM magnesium 
25 acetate, 1 mM dithiothreitol (pH 7.9 at 25°C)]. 

Extension buffer : mM KCI, 20 mM Tris-HCI(Ph 8.8 at 25° C), 10 mM (NH 4 ) 2 S0 4 , 2 
mM MgSO 4 and 0.1% Triton X-100, and 200 pM dNTPs. 



30 Determining the identifier sequence 

The nucleotide sequence of the identifier sequence present in the isolated Afunc- 
tional molecules or the separated identifiers is determined to identify the chemical 
entities that participated in the binding interaction. The synthesis method of the 
synthetic molecule may be established if information on the chemical entities as well 

35 as the point in time they have been incorporated in the synthetic molecule can be 
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deduced from the identifier. It may be sufficient to get information on the chemical 
structure of the various chemical entities that have participated in the synthetic 
molecule to deduce the full molecule due to structural constraints during the forma- 
tion. As an example, the use of different kinds of attachment chemistries may en- 
5 sure that a chemical entity on a building block can only be transferred to a single 
position on a scaffold. Another kind of chemical constrains may be present due to 
steric hindrance on the scaffold molecule or the chemical entity to be transferred. In 
general however, it is preferred that information can be inferred from the identifier 
sequence that enable the identification of each of the chemical entities, that have 
10 participated in the formation of the encoded molecule along with the point in time in 
the synthesis history the chemical entities have been incorporated in the (nascent) 
synthetic molecule. 

Although conventional DNA sequencing methods are readily available and useful for 
1 5 this determination, the amount and quality of isolated Afunctional molecule may 
require additional manipulations prior to a sequencing reaction. 

Where the amount is low, it is preferred to increase the amount of the identifier se- 
quence by polymerase chain reaction (PGR) using PGR primers directed primer 
20 binding sites present in the identifier sequence. 

In addition, the quality of the isolated bifunctional molecule may be such that multi- 
ple species of bifunctional molecule are co-isolated by virtue of similar capacities for 
binding to the target. In cases where more than one species of bifunctional molecule 
25 are isolated, the different isolated species must be separated prior to sequencing of 
the identifier oligonucleotide. 

Thus in one embodiment, the different identifier sequences of the isolated bifunc- 
tional complexes are cloned into separate sequencing vectors prior to determining 

30 their sequence by DNA sequencing methods. This is typically accomplished by am- 
plifying all of the different identifier sequences by PGR as described herein, and 
then using a unique restriction endonuclease sites on the amplified product to direc- 
tionally clone the amplified fragments into sequencing vectors. The cloning and se- 
quencing of the amplified fragments then is a routine procedure that can be carried 

35 out by any of a number of molecular biological methods known in the art. 
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Alternatively, the bifunctional complex or the PCR amplified identifier sequence can 
be analysed in a microarray. The array may be designed to analyse the presence of 
a single codon or multiple codons in a identifier sequence. 

5 

Synthesis of nucleic acids 

Oligonucleotides can be synthesized by a variety of chemistries as is well known. 
For synthesis of an oligonucleotide on a substrate in the direction of 3' to 5', a free 
hydroxy terminus is required that can be conveniently blocked and deblocked as 
10 needed. A preferred hydroxy terminus blocking group is a dimexothytrityl ether 

(DMT). DMT blocked termini are first deblocked, such as by treatment with 3% di- 
chloroacetic acid in dichloromethane (DCM) as is well known for oligonucleotide 
synthesis, to form a free hydroxy terminus. 

1 5 Nucleotides in precursor form for addition to a free hydroxy terminus in the direction 
of 3' to 5' require a phosphoramidate moiety having an aminodiisopropyl side chain 
at the 3' terminus of a nucleotide. In addition, the free hydroxy of the phosphorami- 
date is blocked with a cyanoethyl ester (OCNET), and the 5' terminus is blocked 
with a DMT ether. The addition of a 5' DMT-, 3' OCNET-blocked phosphoramidate 

20 nucleotide to a free hydroxy! requires tetrazole in acetonitrile followed by iodine oxi- 
dation and capping of unreacted hydroxyls with acetic anhydride, as is well known 
for oligonucleotide synthesis. The resulting product contains an added nucleotide 
residue with a DMT blocked 5' terminus, ready for deblocking and addition of a sub- 
sequent blocked nucleotide as before. 

25 

For synthesis of an oligonucleotide in the direction of 5' to 3\ a free hydroxy termi- 
nus on the linker is required as before. However, the blocked nucleotide to be added 
has the blocking chemistries reversed on its 5' and 3' termini to facilitate addition in 
the opposite orientation. A nucleotide with a free 3' hydroxy! and 5' DMT ether is first 

30 blocked at the 3' hydroxy terminus by reaction with TBS-CI in imidazole to form a 
TBS ester at the 3' terminus. Then the DMT-blocked 5 1 terminus is deblocked with 
DCA in DCM as before to form a free 5* hydroxy terminus. The reagent (N,N- 
diisopropylamino)(cyanoethyl) phosphonamidic chloride having an aminodiisopropyl 
group and an OCNET ester is reacted in tetrahydrofuran (THF) with the 5' de- 

35 blocked nucleotide to form the aminodiisopropyl-, OCNET-blocked phosphonami- 
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date group on the 5' terminus. Thereafter the 3' TBS ester is removed with tetrabu- 
tylammonium fluoride (TBAF) in DCM to form a nucleotide with the phosphonami- 
date-blocked 5 1 terminus and a free 3* hydroxy terminus. Reaction in base with 
DMT-CI adds a DMT ether blocking group to the 3' hydroxy terminus. 

5 

The addition of the 3' DMT-, 5' OCNET-blocked phosphonamidated nucleotide to a 
linker substrate having a free hydroxy terminus then proceeds using the previous 
tetrazole reaction, as is well known for oligonucleotide polymerization. The resulting 
product contains an added nucleotide residue with a DMT-blocked 3' terminus, 
10 ready for deblocking with DCA in DCM and the addition of a subsequent blocked 
nucleotide as before. 



Extension and amplification 

The use of the polymerase chain reaction (PCR) is a preferred embodiment, for the 
1 5 production of the identifiers using the nucleic acids of the selected complexes as 
identifiers. 

For use in this invention, the identifier sequences are preferably comprised of poly- 
nucleotide coding strands, such as mRNA and/or the sense strand of genomic DNA 

20 or non-natural nucleic acids, like TNA and LNA which may be used as template for a 
polymerase. If the genetic material to be processed is in the form of double stranded 
nucleic acid, it is usually first denatured, typically by melting, into single strands. The 
nucleic acid is subjected to a PCR reaction by treating (contacting) the sample with 
a PCR primer pair, each member of the pair having a preselected nucleotide se- 

25 quence. The PCR primer pair is capable of initiating primer extension reactions by 
hybridizing to the PCR primer binding site on identifier oligonucleotide, preferably at 
least about 10 nucleotides in length, more preferably at least about 1 2 nucleotides in 
length. The first primer of a PCR primer pair is sometimes referred to as the "anti- 
sense primer because it is extended into a non-coding or anti-sense strand of a 

30 nucleic acid, i.e., a strand complementary to a coding strand. The second primer of 
a PCR primer pair is sometimes referred to as the "sense primer" because it is ad- 
joined with the coding or sense strand of a nucleic acid. 

The PCR reaction is performed by mixing the PCR primer pair, preferably a prede- 
35 termined amount thereof, with the nucleic acids of the sample, preferably a prede- 
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termined amount thereof, in a PGR buffer to form a PCR reaction admixture. The 
admixture is thermocycled for a number of cycles, which is typically predetermined, 
sufficient for the formation of a PCR reaction product, thereby amplifiyng the identifi- 
ers in the isolated complex. 

5 

PCR is typically carried out by thermocycling i.e., repeatedly increasing and de- 
creasing the temperature of a PCR reaction admixture within a temperature range 
whose lower limit is about 30 degrees Celsius (30° C.) to about 55° C. and whose 
upper limit is about 90° C. to about 100° C. The increasing and decreasing can be 
10 continuous, but is preferably phasic with time periods of relative temperature stability 
at each of temperatures favoring polynucleotide synthesis, denaturation and hybridi- 
zation. 



A plurality of first primer and/or a plurality of second primers can be used in each 
1 5 amplification, e.g., one species of first primer can be paired with a number of differ- 
ent second primers to form several different primer pairs. Alternatively, an individual 
pair of first and second primers can be used. In any case, the amplification products 
of amplifications using the same or different combinations of first and second prim- 
ers can be combined for assaying for mutations. 

20 

The PCR reaction is performed using any suitable method. Generally it occurs in a 
buffered aqueous solution, i.e., a PCR buffer, preferably at a pH of 7-9, most pref- 
erably about 8. Preferably, a molar excess of the primer is admixed to the buffer 
containing the identifier strand. A large molar excess is preferred to improve the 
25 efficiency of the process. 



The PCR buffer also contains the deoxyribonucleotide triphosphates (polynucleotide 
synthesis substrates) dATP, dCTP, dGTP, and dTTP and a polymerase, typically 
thermostable, all in adequate amounts for primer extension (polynucleotide synthe- 

30 sis) reaction. The resulting solution (PCR admixture) is heated to about 90° C.-100 0 
C. for about 1 to 10 minutes, preferably from 1 to 4 minutes. After this heating period 
the solution is allowed to cool to a primer hybridization temperature. The synthesis 
reaction may occur at from room temperature up to a temperature above which the 
polymerase (inducing agent) no longer functions efficiently. Thus, for example, if 

35 DNA polymerase is used as inducing agent, the temperature is generally no greater 
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than about 40° C. The thermocycling is repeated until the desired amount of PCR 
product is produced. An exemplary PCR buffer comprises the following: 50 mM KCI; 
10 mM Tris-HCI at pH 8.3; 1.5 mM MgCI2 ; 0.001% (wt/vol) gelatin, 200 |jM dATP; 
200 pM dTTP; 200 pM dCTP; 200 pM dGTP; and 2.5 units Thermus aquaticus (Taq) 
5 DNA polymerase I (U.S. Pat. No. 4,889,818) per 100 microliters (pi) of buffer. 

The inducing agent may be any compound or system which will function to accom- 
plish the synthesis of primer extension products, including enzymes. Suitable en- 
zymes for this purpose include, for example, E. coli DNA polymerase I, Klenow 

10 fragment of E. coli DNA polymerase I, T4 DNA polymerase, Taq DNA polymerase, 
Pfu polymerase, Vent polymerase, HIV-1 Reverse Transcriptase, other available 
DNA polymerases, reverse transcriptase, and other enzymes, including heat-stable 
enzymes, which will facilitate combination of the nucleotides in the proper manner to 
form the primer extension products which are complementary to each nucleic acid 

1 5 strand. Generally, the synthesis will be initiated at the 3' end of each primer and 

proceed in the 5' direction along the identifier strand, until synthesis terminates, pro- 
ducing molecules of different lengths. There may be inducing agents, however, 
which initiate synthesis at the 5' end and proceed in the above direction, using the 
same process as described above. 

20 

The inducing agent also may be a compound or system which will function to ac- 
complish the synthesis of RNA primer extension products, including enzymes. In 
preferred embodiments, the inducing agent may be a DNA-dependent RNA poly- 
merase such as T7 RNA polymerase, T3 RNA polymerase or SP6 RNA polymerase. 

25 These polymerases produce a complementary RNA polynucleotide. The high turn- 
over rate of the RNA polymerase amplifies the starting polynucleotide as has been 
described by Chamberlin et al. a The Enzymes, ed. P. Boyer, pp. 87-108, Academic 
Press, New York (1982). Amplification systems based on transcription have been 
described by Gingeras et al., in PCR Protocols, A Guide to Methods and Applica- 

30 tions, pp. 245-252, Innis et al., eds, Academic Press, Inc., San Diego, Calif. (1990). 

If the inducing agent is a DNA-dependent RNA polymerase and, therefore incorpo- 
rates ribonucleotide triphosphates, sufficient amounts of ATP, CTP, GTP and UTP 
are admixed to the primer extension reaction admixture and the resulting solution is 
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treated as described above. 

The newly synthesized strand and its complementary nucleic acid strand form a 
double-stranded molecule which can be used in the succeeding steps of the 
5 method. PCR amplification methods are described in detail in U.S. Pat. Nos. 

4,683,192, 4,683,202, 4,800,159, and 4,965,188, and at least in several texts in- 
cluding PCR Technology: Principles and Applications for DNA Amplification, H. Er- 
lich, ed., Stockton Press, New York (1989); and PCR Protocols: A Guide to Methods 
and Applications, Innis et al. f eds., Academic Press, San Diego, Calif. (1990). The 

10 term "primer" as used herein refers to a polynucleotide whether purified from a nu- 
cleic acid restriction digest or produced synthetically, which is capable of acting as a 
point of initiation of nucleic acid synthesis when placed under conditions in which 
synthesis of a primer extension product which is complementary to a nucleic acid 
strand is induced, i.e. t in the presence of nucleotides and an agent for polymeriza- 

15 tion such as DNA polymerase, reverse transcriptase and the like, and at a suitable 
temperature and pH. The primer is preferably single stranded for maximum effi- 
ciency, but may alternatively be in double stranded form. If double stranded, the 
primer is first treated to separate it from its complementary strand before being used 
to prepare extension products. Preferably, the primer is a polydeoxyribonucleotide. 

20 The primer must be sufficiently long to prime the synthesis of extension products in 
the presence of the agents for polymerization. The exact lengths of the primers will 
depend on many factors, including temperature and the source of primer. For exam- 
ple, depending on the complexity of the target sequence, a polynucleotide primer 
typically contains 10 to 25 or more nucleotides, although it can contain fewer nu- 

25 cleotides. Short primer molecules generally require cooler temperatures to form suf- 
ficiently stable hybrid complexes with identifier. 

The primers used herein are selected to be "substantially" complementary to the 
different strands of each specific sequence to be synthesized or amplified. This 
30 means that the primer must be sufficiently complementary to non-randomly hybrid- 
ize with its respective identifier strand. Therefore, the primer sequence may or may 
not reflect the exact sequence of the identifier. For example, a non-complementary 
nucleic acid can be attached to the 5* end of the primer, with the remainder of the 
primer sequence being substantially complementary to the strand. Such non- 
35 complementary fragments typically code for an endonuciease restriction site or used 
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as a linker to connect to a label, such as biotin. 



Primers of the present invention may also contain a DNA-dependent RNA polymer- 
ase promoter sequence or its complement. See for example, Krieg et al., Nucl. A- 
5 cids Res., 12:7057-70(1984); Studier et al., J. Mol. Biol., 189:1 13-130 (1986); and 
Molecular Cloning: A Laboratory Manual, Second Edition, Maniatis et al., eds., Cold 
Spring Harbor, N.Y. (1989). 



When a primer containing a DNA-dependent RNA polymerase promoter is used, the 
10 primer is hybridized to the polynucleotide strand to be amplified and the second 
polynucleotide strand of the DNA-dependent RNA polymerase promoter is com- 
pleted using an inducing agent such as E. coli DNA polymerase I, or the Klenow 
fragment of E. coli DNA polymerase. The starting polynucleotide is amplified by al- 
ternating between the production of an RNA polynucleotide and DNA polynucleo- 
15 tide. This may be used for selective degradation of the RNA strand, which is prone 
to disintegration upon treatment with a strong base. 

Primers may also contain a identifier sequence or replication initiation site for a 
RNA-directed RNA polymerase. Typical RNA-directed RNA polymerase include the 
20 QB replicase described by Lizardi et al., Biotechnology, 6:1 197-1202 (1988). RNA- 
directed polymerases produce large numbers of RNA strands from a small number 
of identifier RNA strands that contain a identifier sequence or replication initiation 
site. These polymerases typically give a one million-fold amplification of the identifier 
strand as has been described by Kramer et al., J. Mol. Biol., 89:719-736 (1974). 

25 

In one embodiment, the present invention utilizes a set of polynucleotides that form 
primers having a priming region located at the 3-terminus of the primer. The 3'- 
terminal priming portion of each primer is capable of acting as a primer to catalyze 
nucleic acid synthesis, i.e., initiate a primer extension reaction off its 3* terminus. 
30 One or both of the primers can additionally contain a 5'-terminal non-priming portion, 
i.e., a region that does not participate in hybridization to the preferred identifier. The 
5'-part of the primer may be labelled as described herein above. 

Brief Description of the Figures 

35 Fig. 1a and 1b shows a picture of a gel resulting from example 1 
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Fig. 2 discloses a picture of a gel resulting from example 2. 
Fig. 3 depicts a picture of gel which shows the results of example 3. 
Fig. 4 discloses a schematic representation of two embodiments of the present in- 
vention. 

5 

Detailed Description of the drawings 

Fig. 4 discloses details of the present invention. Initially, a library of complexes 
(frame 1) is provided. The potential drug candidate is represented by a circle and 
the identifier is indicated by a rectangle. The straight line between the identifier and 
10 the potential drug candidate illustrate a linkage which in some embodiments is 
cleavable and in other embodiments are durable under the conditions used. The 
potential drug candidate and the identifier are filled with a suitable pattern to illus- 
trate that the individual complexes are different from one another. 

15 In process step A, the library is subjected to a target in solution (frame 2). The part 
of the library which has affinity towards the target will bind to the target. The symbol 
3 indicate a selectively cleavable linker, e.g. a PC spacer, cleavable by exposure to 
light of a certain wave length. Following process step I, the target may be immobi- 
lized to a solid support indicated by the container wall utilizing a linkage a (frame 3). 

20 The linkage a may be selectively cleavable or a durable linker. Preferably, in proc- 
ess step C, The non-binding part of the library is removed and the aggregate of the 
complex and the target is recovered (frame 4). The recovery of the aggregate of 
target and complex when in solution may be performed by size-exclusion chroma- 
tography. 

25 

After the recovery of the aggregate of complex and target, the target may be immo- 
bilized following process path I (frame 5). Preferably, using process step E, the link- 
age p is cleaved thus separating the potential drug candidate from the identifier 
(frame 6). After the separation, the identifier is isolated to decode the identity of the 
30 potential drug candidate. Prior to the isolation, the target bound to the potential drug 
candidate may be immobilized to facilitate the isolation. Optionally, the isolated 
identifier (frame 8) is amplified by e.g. PCR prior to the identification step. 

Following the route starting with process step B, the library of complexes is sub- 
35 jected to an immobilized target. Complexes having affinity towards the target will 
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bind thereto (frame 3). In effect, the complex is immobilized. The non-binding com- 
plexes are washed away in process step D and the immobilized complex-target- 
aggregate is isolated (frame 5). In the event the linkage a is selectively cleavable, 
the process step I may be followed to form the isolated aggregate in frame 4. The 
i 5 identifier may be cleaved off and the decoded to identify the identity of the potential 
drug candidate. When the linkages a and (3 are both selectively cleavable, they are 
preferable orthogonal cleavable, i.e. the conditions which cleave the linkage a does 
not cleave linkage p and visa versa. 

10 In the event the linker a is durable, i.e. not cleaved under the conditions used, the 
process step F is followed. The immobilized target is maintained connected to the 
solid support while the identifier is cleaved to the potential drug candidate (frame 7). 
The identifier is subsequently isolated using process step H (frame 8). Prior to de- 
coding the identifier may be amplified by suitable process to obtain an appropriate 

1 5 amount of genetic material. 

Examples 

Example 1: Enrichment of biotin tagged DNA identifier. 

20 100 pico mol of DNA identifier T1 of the sequence (GAGCGGATGCGTACATCTTGTA- 
CATGTCAATGCGATCGATCGACTGATCAAGATAGACCGAGTGCCG) was annealed to 
50 pico mol of a biotin tagged primer F1-PC-B (Biotin-PC- 
CGGCACTCGGTCTATCTT), PC is a photo cleavable group and is obtainable as a 
phosphoramidite from Glen Research (Products cat# 10-4913). Subsequently the 

25 F1-PC-B primer was extended by DNA polymerase (Sequenase from Upstate Bio- 
technology Cat# 70775Y). Likewise 50 pico mol of a DNA identifier T2 
(GAGCGGATGCGTACATCTACGATGGATGCTCCAGGTCGCAAGATAGAC- 
CGAGTGCCG) was annealed to 100 pico mol of a primer F1 
(CGGCACTCGGTCTATCTT) and extended by DNA polymerase. This generated 

30 two pools of double stranded DNA sequences each containing 50 pico mol of the 
following complexes, one of length 67 nucleotides (termed T1+B) and one of length 
57 nucleotides termed (T2-B). 

Enrichment of the biotin tagged T1+B complex was performed in Reacti-Bind™ 
35 streptavidin coated clear strip plates (Pierce Biotechnology, Cat # 15120). The two 
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pools of double stranded DNA, T1+B and T2-B where mixed in ratios 1 pmol of 
T1+B complex and 1 pmol of T2-B complex, 1/10 pmol of T1+B complex and 1 pmol 
of T2-B complex, 1/100 pmol of T1+B complex and 1 pmol of T2-B complex, 1/1 ,000 
pmol of T1+B complex and 1 pmol of T2-B complex, 1/10,000 pmol of T1+B com- 
5 plex and 1 pmol of T2-B complex, 1/100,000 pmol of T1+B complex and 1 pmol of 
T2-B complex and 1/1,000,000 pmol of T1+B complex and 1 pmol of T2-B complex. 
All where mixed in 100 fxL buffer A (Tris buffered saline, 0.05% Tween 20, 1% Bo- 
vine serum albumin, 0.1 mg/mL herring sperm DNA). Incubation in streptavidin 
coated wells was done for 30 min at 25°C. After ligand binding wells were washed 

10 30 times with 250 \iL buffer A during one hour. Thereafter 100 \iL buffer A was ap- 
plied to each well and the wells where exposed to UV light at 350 nano meters for 
30 seconds in order to cleave the PC spacer thereby releasing the T1 identifier from 
the biotin molecule. Following exposure to UV light the elution volume was removed 
immediately and analyzed for the presence of DNA strands T1 and T2 by polymer- 

15 ase chain reaction (PCR). 

The content of T1 and T2 following enrichment for biotin binding molecules was 
analysed by generating 32 P phosphate labeled PCR products of T1 and T2. Both T1 
and T2 PCR products where generated by the use of the two primers F1 

20 (CGGCACTCGGTCTATCTT) and R1 (GAGCGGATGCGTACATCT). R1 was la- 
beled with 32 P phosphate by the use of T4 polynucleotide kinase from Promega 
(Cat# M4101). PCR was performed by adding 10 |xL of the eluted volume together 
with 15 M.L of water containing 1 pico mole of both F1 and 32 P-labeled R1 to puRe- 
Taq™ Ready-To-Go™ PCR Beads (Amersham Biosciences, Cat# 407513-96). PCR 

25 was performed for 30 cycles by annealing at 58°C and extending at 72'C. PCR gen- 
erated products where separated by urea containing polyacrylamide gel electropho- 
resis and visualized by radiography. 

Eight wells precoated with streptavidin were used for the enrichment experiment 
30 shown in Fig. 1A. Lane 1 in Fig 1A. represents PCR amplification on eluate from a 
well that was incubated with Buffer A without any identifier complexes. Lane 2 was 
incubated with Buffer A containing 1 pmol of T1+B complex and 1 pmol of T2-B 
complex. Lane 3 was incubated with Buffer A containing 1/10 pmol of T1+B complex 
and 1 pmol of T2-B complex. Lane 4 was incubated with Buffer A containing 1/100 
35. pmol of T1+B complex and 1 pmol of T2-B complex. Lane 5 was incubated with 
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Buffer A containing 1/1,000 pmol of T1+B complex and 1 pmol of T2-B complex. 
Lane 6 was incubated with Buffer A containing 1/10,000 pmol of T1+B complex and 
1 pmol of T2-B complex. Lane 7 was incubated with Buffer A containing 1/100,000 " 
pmol of T1+B complex and 1 pmol of T2-B complex. Lane 8 was incubated with 
5 Buffer A containing 1/1,000,000 pmol of T1+B complex and 1 pmol of T2-B complex. 

Lane 9 in Fig. 1A represents PCR amplification of the input into well 2. Lane 10 rep- 
resents PCR amplification of the input into well 3. Lane 1 1 represents PCR amplifi- 
cation of the input into well 4. Lane 12 represents PCR amplification of the input into 
10 well 5. Lane 13 represents PCR amplification of the input into well 6. Lane 14 repre- 
sents PCR amplification of the input into well 7. Lane 15 represents PCR amplifica- 
tion of the input into well 8. Lane 16 represents PCR amplification of water (Blank). 

In Fig. 1B the PCR products represented in Fig. 1A were subjected to additional 30 
1 5 cycles of PCR amplification. This revealed T1 and T2 identifiers of very low abun- 
dance. Lane 1 represents further PCR amplification of PCR product as represented 
in Fig. 1B lane 1. Lane 2 represents further PCR amplification of PCR product as 
represented in Fig. 1B lane 2. Lane 3 represents further PCR amplification of PCR 
product as represented in fig 1 lane 3. Lane 4 represents further PCR amplification 
20 of PCR product as represented in Fig. 1 B lane 4. Lane 5 represents further PCR 
amplification of PCR product as represented in Fig. 1B lane 5. Lane 6 represents 
further PCR amplification of PCR product as represented in Fig. 1B lane 6. Lane 7 
represents further PCR amplification of PCR product as represented in Fig. 1 lane 7. 
Lane 8 represents further PCR amplification of PCR product as represented in Fig. 
25 1Blane8. 

Lane 9 of Fig. 1B represents PCR amplification of water (Blank). Lane 10 represents 
PCR amplification of the input into well 2 of Fig. 1B. Lane 1 1 represents PCR ampli- 
fication of the input into well 3 of Fig. 1B. Lane 12 represents PCR amplification of 
30 the input into well 4 of Fig. 1B. Lane 13 represents PCR amplification of the input 
into well 5 of Fig. 1B. Lane 14 represents PCR amplification of the input into well 6 
of Fig. 1B. Lane 15 represents PCR amplification of the input into well 7 of Fig. 1B. 
Lane 16 represents PCR amplification of the input into well 8 of Fig. 1B. Lane 17 
represents PCR amplification of water (Blank). 

35 
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From Fig. 1 A lane 6 it can be inferred that the double stranded DNA complex T1+B 
was enriched more than ten thousand fold over the T2-B complex. Further enrich- 
ment can not be deduced from Fig. 1 A since the number of identifiers recovered and 
analyzed by PCR in lane 7 and 8 where to few to produce detectable bands after 30 
5 rounds of PCR. When subjecting the PCR products shown in Fig. 1B to an addi- 
tional 30 rounds of PCR the T2 product, in addition to the T1 product, appears in 
Fig. 1B lanes 5, 6, 7 and 8. The T2 product becomes the dominating PCR product in 
lane 8 suggesting that enrichment of the T1 +B complex over the T2-B complex is 
between one hundred thousand and one million fold. 

10 

Example 2: Enrichment of di-nitro-phenol tagged DNA identifier. 
100 pico mol of a DNA identifier, T1 (GAGCGGATGCGTACATCTTGTACATGTCA- 
ATGCGATCGATCGACTGATCAAGATAGACCGAGTGCCG) was annealed to 50 pico 
mol of a di-nitrophenol (DNP) tagged primer F1-PC-DNP (DNP-PC- 
15 CGGCACTCGGTCTATCTT). PC is a photo cleavable group obtainable as a phos- 

phoramidite from Glen Research (Products cat# 10-4913). Subsequently the F1-PC- 
DNP primer was extended by DNA polymerase (Sequenase from Upstate Biotech- 
nology Cat# 70775Y). Likewise 50 pico mol of a DNA identifier T2 
(GAGCGGATGCGTACATCTACGATGGATGCTCCAGGTCGCAAGATAGAC- 

20 CGAGTGCCG) was annealed to 100 pico mol of a primer F1 

(CGGCACTCGGTCTATCTT) and extended by DNA polymerase. This generated 
two pools of double stranded DNA sequences each containing 50 pico mol of the 
following complexes, one of length 67 nucleotides termed T1+DNP and one of 
length 57 nucleotides termed T2-DNP. 

25 

Enrichment of the DNP tagged T1+DNP complex was performed by coating 1 \ig 
anti DNP antibody in Nunc Immunomodule U8 Maxisorp (Biotecline cat # nuh- 
47507). The two pools of double stranded DNA, T1+DNP and T2-DNP where mixed 
in ratios 1 pmol of T1+DNP complex and 1 pmol of T2-DNP complex, 1/10 pmol of 

30 T1 + DNP complex and 1 pmol of T2-DNP complex, 1/100 pmol of T1+DNP complex 
and 1 pmol of T2-DNP complex, 1/1000 pmol of T1+DNP complex and 1 pmol of 
T2-DNP complex, 1/10000 pmol of T1+DNP complex and 1 pmol of T2-DNP com- 
plex, 1/100000 pmol of T1+DNP complex and 1 pmol of T2-DNP complex and 
1/1 000000 pmol of T1 +DNP complex and 1 pmol of T2-DNP complex. All where 

35 mixed in 100 \il buffer A (Tris buffered saline, 0.05% Tween 20, 1% Bovine serum 



